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Abstract. We consider the problem of sorting elements on a series of stacks, introduced by Tarjan and Knuth. We 
improve the asymptotic lower bound for the number of stacks necessary to sort n elements to 0.561 log 2 n + O(l). 
This is the first significant improvement since the previous lower bound, i log 2 n + O ( 1 ) , was established by Knuth 
in 1972. 



1 Introduction 

The subject of our paper is a mathematical puzzle or game for one player. The game is played with 
n elements labelled from 1 to n on a system of k stacks (also labelled consecutively) arranged in 
series, with a queue at either end. The queue at one end is called the input queue and the queue 
at the other end is the output queue. The game begins with all stacks and queue empty, except for 
the input queue, which contains the elements (from front to back) cr(l), cr(2), . . . , cr(n) for some 
arbitrary permutation a £ S n . The player has the following legal moves: 

1. Dequeue an element from the front of the input queue and push it onto stack 1, 

2. Pop an element from stack % and push it onto stack % + 1, for 1 < i < k, 

3. Pop an element from stack k and enqueue it on the output queue. 

That is, the player may move an element from one stack or queue to the next. Eventually all the 
elements will reach the output queue, so there are no more legal moves and the game ends. The 
player wins if, at the end of the game, the elements in the output queue are in sorted order. 

The problem of sorting elements with a single stack was introduced by Knuth in [5], which in- 
cludes several exercises about stack sorting. For instance, exercise 2.2.1.5 (p. 239) asks the reader 
to show that n £ S n cannot be sorted with one stack if and only if there exist indices i < j < k 
such that 7r(j) < ir(k) < 7r(i). Another exercise shows that a single stack can sort exactly ^-j- ( 2 ™) 
permutations. Together these exercises comprise a founding result in the field of permutation pat- 
terns. 

Tarjan generalized stack sorting to networks of stacks and queues in [7], including systems of 
stacks arranged in series. Tarjan showed that a system of k stacks in series can sort 3-2 fc_1 elements, 
and cited []6|| for a result that k stacks cannot sort A k elements. Knuth revisited stack sorting in j6l 
and gave two exercises on series of stacks. The exercises concern the minimum of stacks required 
to sort any permutation of n elements, which we denote k n . Exercise 5.2.4.20 (p. 170) asks for the 
rate of growth of k n as a function of n. Knuth lists this exercise as an open problem, and it remains 
an open problem today. Furthermore, there has been no improvement on the elementary bounds 

i log 2 n + 0(1) < k n < log 2 n + 0(1) 



given in [6]. See Q~| for an excellent survey on stack sorting. 
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Table 1. Values of k n for small n. 



The main result in this paper is a better lower bound for k n . We will start by re-proving Knuth's 
lower bound, k n >\ log 2 n + 0(1). Then we will improve the lower bound to 0.513 log 2 n + 0(1) 
by counting strings of moves, and explore several variations on this approach. Finally, we combine 
several techniques and obtain a bound of k n > 0.561 log 2 n + 0(1) with the aid of a computer. 

2 Notation and Definitions 

Let us start by introducing some formal notation to describe the game. Define the current state of 
the game to be a description of the elements in each stack and queue (for instance, a list of elements 
from bottom to top for each stack and front to back for each queue). Let & k be the set of all possible 
states for a system of k stacks containing finitely many elements, as well as an additional symbol 
to indicate an illegal state. We say a state is initial if all the elements are in the input queue, and 
a state is final if all the elements are in the output queue. Let I(n, k) be the initial state such that 
the elements are in the input queue in sorted order. Similarly, let F(n, k) be the final state with 
the elements in the output queue in sorted order. Note that we win the game if we finish in state 
F(n, k). 

A move is some action by the player that changes the state. We let mi denote the move that 
pushes an element onto stack i for each 1 < % < k, and let m fc+1 be the move that pushes to 
the output queue, so the complete set of moves is M k = {mi, . . . ,m fe+1 }. Let Mjt denote the 
free monoid on M k , or (abusing notation slightly) the set of finite strings over M k . We adopt 
the following common notation for string-related concepts. Given u,v E Ml, let uv denote their 
concatenation, let \u\ denote the length of u and let \u\ i be the number of occurrences of mi in u. 
Note that we will sometimes write a long sequence of moves m^m^ ■ ■ ■ in the more compact 
notation m ili2 ...i.. 

The relationship between moves and states is described by a monoid action * : Mjt x & k — > & k , 
where sequences of moves act on states. Suppose w G Ml is a sequence of moves and s G & k is a 
state. Define w * s to be the state obtained by starting with state s and performing moves from the 
sequence w one at a time. Define w * := for all w and w * s := if some move in w is illegal 
when we apply the sequence to s. 

2.1 Sortable Permutations 

Suppose we have some 7r e S n , let s w € & k be the initial state with input queue (from front to 
back) 7r(l), . . . , 7r(n) and let t w e & k be the final state with output queue 7r(l), . . . , n(n). We say 
7r is sortable by k stacks if there exists a sequence of moves w G Ml such that w * s w = F(n, k). 
Similarly, we say n is generated by k stacks if there exists a sequence of moves w G Ml such that 

w * I(n, k) = t w . 

Definition 1. Let G(n, k) C S n be the set of permutations generated by k stacks. 



Fig. 1. Sorting the permutation (4231) w ' tn ^ stacks using the sequence 771121121232333 (broken into 7 steps mi2, mi, 77112, 11123, 
77123, m 3 , m 3 ). 
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Recall that our goal is to study the number of stacks necessary to sort n elements. Clearly we can 
sort all permutations if and only if we can generate all permutations (that is, if G(n, k) = S n ). 

Definition 2. For each n > 0, let k n denote the smallest integer such that G(n,k n ) = S n . 

In other words, k n is the number of stacks required to sort n elements. The sequence {k i ]°l l tells 
us precisely when G(n, k) = S n according to the following proposition. 

Proposition 1. For all n, k > 0, we can sort any permutation of n elements on k stacks (that is, 
G(n, k) = S n ) if and only ifk > k n . 

Proof. When k < k n , the result holds by the definition of k n . When k > fc„, we can sort any 
permutation by first moving all the elements to stack k — k n and then using the remaining k n stacks 
to actually sort them. 



2.2 Complete Strings 

We will study sortable permutations by studying strings of moves. Naturally, we are interested in 
the set of strings that generate permutations. This motivates the following definitions. 



Definition 3. We say a string w G Ml is complete if w * s is a final state for some initial state 
s <E & k . We say w is n-complete ifw * I(n, k) is a final state. Let £/(n, k) C Ml be the set of all 
n-complete strings. 

The following lemma characterizes n-complete strings. 
Lemma 1. Let w G Mj*. Then the following are equivalent: 

1. w belongs to the set £/(n, k). 

2. w can be partitioned into n subsequences, all equal to • ■ • m k+1 . 

3. \u\ 1 > \u\ 2 > • ■ ■ > \u\ k+1 > Ofor all prefixes u ofw and = • • • = = n. 

Proof. 

(1) =► (2) 

Given an n-complete sequence w G M£, we can apply w to the initial state I(n,k). Each 
symbol in w manipulates a single element when we apply it to a state. We may partition the 
symbols in w into n subsequences, where each subsequence consists of all moves that manip- 
ulate a chosen element. Each element travels from the input stack to the output stack, so the 
corresponding subsequence is mim 2 • • • m k+1 . 

(2) =► (3) 

Since w is composed of n copies of miro 2 • • • m k+ i, we have \w\ i = n for all i. Similarly, any 
prefix u of w is composed of n prefixes of mim 2 ■ ■ ■ m k+1 , so it is clear that n>\u\ 1 > \u\ 2 > 

• • • > > o. ' ' " ' " 

(3) =► (1) 

The fact that \u\ 1 > \u\ 2 > ■ ■ > \u\ k+1 for all prefixes u of w ensures that, when we apply 
w, we never attempt to pop from an empty stack. The sequence w dequeues n elements from 
the input queue since \w\ 1 = n, so w is a legal sequence of moves on I(n, k). Furthermore, 
applying w to I(n, k) moves all the elements to the last stack, since \w\-y = ■ ■ ■ = |iu| fc+1 = n. 
Hence, w e Ci(n, k). 

This completes the proof. 

We note that any string in Ci(n, k) contains rrii exactly n times, and therefore has length n(k + 1). 
Hence, we also define 

£u(n, k) := {ue Ml : \u\ 1 = ■■■ = \u\ k+1 = n} 
C m (n, k) := {u G M* k : \u\ = n(k + 1)}, 

and note that Ci(n, k) C £ n (^, k) C C m (n, k) for all n, k > 0. We call the three languages type 
I, type II and type III respectively. In many cases, Cu(n, k) and C m (n, k) are easier to work with 
than £i(n, k), and one can show that the three languages have the same number of strings up to a 
polynomial factor. 

3 Known Lower Bound 

In this section we will prove Knuth's lower bound for k n , but we need a lemma first. Define the 
product of two sets of permutations, A,BeS n as AB := {ab : a G A,b G B}. 



Lemma 2. For all n > and k > 0, we have G(n, k) = G(n, l) k = {tti o . . . o ir k : 7Ti, . . . , tt^ G 
G(n,l)}. 

Proof. Suppose we have a string w G £;), and we apply it to an initial state with n elements. 
Consider the ith stack and note that the elements enter in the order x\, . . . , x n and leave in the 
order n^xi), . . . , 7Tj(x n ), for some 7Tj G G(n, 1). Then the permutation generated by the system is 
clearly the composition n k o • • • o m G G(n, l) h , so G(n, k) C G(n, l) k . 

In the other direction, let tti, . . . ,n k be permutations in G(n, 1). There exist strings W\ G 
{m 1 ,m 2 }*, w 2 G {m 2 ,m 3 }*, w k G {m fc ,m fc+1 }* such that stack 2 generates 7T, when we 
apply Wi, for each I < i < k. We would like to find w G k) such that is the subsequence 
of moves in w that manipulate stack i, for each i. If we can find such a string, then it must generate 
Tc k o • ■ ■ o 7Ti, and therefore show that G(n, l) k C /c). 

Let us start by numbering the symbols in w±, . . . , w k so that the jth occurrence of m; is written 
Define a relation -< such that mP 1 ^ -< if m,-f is before raf„ in some we, and then close 
-< under transitivity to obtain a partial order (we leave it as an exercise to show that this can be 
done). Now we extend -< to a total order and list the symbols in sorted order to obtain w G A(n, k). 

Now that we have Lemma[2l we can prove Knuth's lower bound for k n . This is not a new result, 
but we will use the same ideas for later theorems, so it is a useful example. 

Theorem 1. For all n,k > Owe have \G(n, k) \ < A nk ,from which it follows that \ log 2 n+0(l) < 

Proof. The lemma gives us G(n, k) = G(n, l) k and hence \G(n, k)\ = \G(n, l) k \ < \G(n, l)\ k . 
Every permutation is generated by some string in Cj(n, k), so \G(n, 1)| < |£i(n,l)| < |>Cin(n, 1)| = 
4 n . We conclude that \G(n, k) \ < A nk for all n, k > 0. 

Now recall that G(n, k n ) = S n by definition, so n\ = \G(n, k n )\ < A nkn . Taking logarithms 
gives log 2 n\ < 2nk n , and then we apply Stirling's approximation to obtain | log 2 n + 0(1) < k n . 

4 Working Towards a Better Lower Bound 

We claim that the lower bound presented in Theorem Q] can be improved. The key is to show that 
\G(n, k) | grows exponentially slower than A nk , which is the bound we use in the theorem, derived 
from the inequality \G(n, k)\ < \G(n, l)\ k . Unfortunately, \G(n, l)| fc is a poor upper bound for 
\G(n, k)\, even for small n and k. For example, |G(4,2)| = |5 4 | = 24, but |G(4, 1)| 2 = 14 2 = 196. 

Conceptually, the inequality \G(n, k)\ < \G(n, l)\ h comes from breaking the system of stacks 
into k single stack systems, and assuming those systems do not interact. The problem is that ad- 
jacent stacks do interact, and one stack often undoes work done by another stack. For instance, 
if the first stack reverses the elements and the second stack reverses them again, then we have 
accomplished nothing with those stacks. 

Now suppose that we break the system into groups of i stacks instead of single stacks, where 
£ > 1 is a constant. If we can show that \G(n, £) \ grows slower than \G(n, 1) | , then we can prove 
a better lower bound with the following proposition. 

Proposition 2. Let £ > 1 be a constant. If\G(n,£)\ G 0(b n ) then k n > + 0(1). 



Proof. We can use Lemma |2] to show that 

n\ = \G(n,k n )\ < \G(n,£y kn/e ^ \ < \G(n,£)\ kn/l+1 < c b n{kn/e+1) . 

Taking logarithms and using Stirling's approximation gives 

nlog 2 n + 0{n) < log 2 [ c b n ^ /e+1) ] 
log 2 n + 0(l) < (k n /£)\og 2 b 

p^ + o(i)< kn , 

log 2 o 

completing the proof. 

In the case i = 1 with \G(n, 1)| G 0(4") we get |log 2 n + 0(1) < k n , which is the lower 
bound from Theorem [Q In this section we will focus on £ = 2 since it is the easiest case after 
£ = 1. Unfortunately, we cannot use \Ci(n, 2) | as a bound for \G(n, 2) | (as we did for the one stack 
case) since \£>i(n, 2)| ~ |£m(n, 2)| = 27 n > 16™. Evidently the ratio of n-complete strings to 
generate permutations, |^'^'^| , is about (27/16) n . Contrast this with one stack, where it turns out 

that |A(M)| = |G(n,2)|. ' 

In the next section, we define an equivalence relation that expresses when two strings generate 
the same permutation. We will show that we can rewrite strings in \Cm(n, 2)| to a (more or less) 
canonical string in the same equivalence class. Then we will count only the canonical strings to 
bound \G(n, 2) | and obtain a better lower bound for k 2 . 

4.1 Equivalence of Strings 

Recall that Ml is a monoid that acts on game states &k via *. Let us define an equivalence relation 
~ on Ml such that x ~ y if x * s = y * s for all s £ &k- In other words, two strings x and y are 
equivalent if they act on every state in exactly the same way. 
We note three properties of ~ without proof. 

Proposition 3. The relation ~ is a congruence relation. That is, ifu ~ v and x ~ y then ux ~ v y. 

Proposition 4. Suppose u, v G Ml are strings. Then u*s = v*sj^0 for some s G if and 
only ifu^v. 

Lemma 3. For strings a, b,u,v G Ml, we have aub ~ avb if and only ifu ~ v. 

For example, every string in A(2, 2) generates a permutation in G(2, 2). Therefore ~ splits 
£i(2, 2) into two congruence classes: m 123 i 23 ~ m 12 i 323 ~ mn 2233 and mn 2323 ~ m m233 . Using 
cancellation, we can deduce a set of relations, R := {m 13 ~ m 31 , m 1223 ~ m 2312 , m 1232 ~ m 2123 }. 

The first relation, mi 3 ~ m 3i , says that performing move mi followed by m 3 is indistin- 
guishable from performing m 3 and then mi. This makes sense, since mi affects the input queue 
and stack 1, while m 3 affects stack 2 and the output queue. Since mi and m 3 operate on differ- 
ent pieces of the system, they should commute. The other two relations, m 12 m 2 3 ~ ^23^12 an d 
Tni23m 2 ~ m 2 mi 23 are also commutativity relations uv ~ v u for strings u, v G M 2 that operate 
on different parts of the system. 

We can generate a congruence relation ~# (based on the relations in R) such that m i3 ~# 
^31, ^1223 ~i? "^2312, ^1232 ~_r ^2123- It follows that for u, v G M 2 , if u v then u ~ v. In 
other words, ~ R is finer relation than ~. One can show that m i22233 ~ m 2233 i 2 but mi 22233 ^ R 
^223312, so ~ R is strictly finer than ~. 



4.2 String Rewriting 

Recall that every string in Ci(n, 2) generates a permutation. Since two strings u,v E M% generate 
the same permutation if and only if u ~ v, the congruence classes of Ci(n, 2) correspond exactly 
to permutations in G(n, 2). We would like to find a set of representative strings, one for each 
equivalence class of M|. Unfortunately, we do not know how to do this for ~. Instead we will find 
a set of representatives V for the equivalence classes of M| under the relation 

Let V contain the lexicographically maximal string from each equivalence class of Mf . Sup- 
pose w is a string in V containing m 13 as a substring. Then we can write w = am 13 b for some 
a, b e M| and note that w' := am 3 ib is lexicographically larger than w. But w' is in the same 
equivalence class as w since w ~# 77/ , so we have a contradiction. Therefore all strings in V avoid 
having m 13 as a substring. There is a similar argument for the other relations in R, so strings in V 
avoid mi223 and 772,1232 as well. 

Define U C Mjt to be the set of strings that do not contain mi 3 , 777,1223 or 772,1232 as substrings. 
We have just shown that V CU.lt turns out that U. = V, but this is more than we need, so we omit 
the proojj. 

4.3 Enumerating Strings in U 

Let 7r be an arbitrary permutation in G(n, 2), generated by some string w in £i(n, 2). This string 
has a representative w' in V C U such that 77; ~# 77/ and hence 7// also generates G(n, 2). It follows 
that w' is also in Ci(n, 2) C £in(n, 2). Thus, any permutation in G(77, 2) is generated by a string 
in the set Cm(n, 2) D U. Each string in Cm(n, 2) DU can generate at most one permutation, so we 
have \G(n, 2)| < |£ ra (r7, 2) n U\. Note that C m (n, 2) is the set of strings in M 2 * of length 377, so 
if we can find a generating function U (x) that counts strings in hi (weighted by the length of the 
string), then 



There are several published techniques for counting strings avoiding some finite set of for- 
bidden substrings. For example, there is a technique using systems of equations by Guibas and 
Odlyzko 01, or the Goulden-Jackson cluster method in H2I3L Using either method, the generating 
function for U (x) is 



|G(n,2)| < |£in(n,2) nW| = [x 3n ]U(x). 



K 1 1 - 3x + x 2 + 2x 4 ' 
The denominator has distinct roots, so the partial fraction decomposition is 



1 




where ci, . . . , c 4 , Ai, . . . , A 4 e C, with Ai, . . . , A 4 are the roots of 1 — 3x + x 2 + 2x 4 : 



Ai = 0.40671 
A 2 = 0.77626 



A 3 = -0.59149 + 1.11087 
A 4 = -0.59149 - 1.1108?;. 



1 The proof amounts to showing that the rewriting rules mis — > m3i, m-1223 — > 1T12312, m-1232 — > 771,2123 form a confluent 
system. 



Each of the four terms is a geometric series and it is not difficult to show that [^"Ix^ * s m 
6>(|A,r n ) for all i. The first term dominates, so [x n ]U(x) is in GQX^'") = 0(2.45875"). It 
follows that 

|G(n,2)| < |£rn(n,2) n«| 
= [x 3n }U(x) 
e 0(14.864 n ). 

We can now prove our first improvement to the lower bound: 
Theorem 2. For all n, 

k n > ; T~/\~liaA + °W = °- 51364 lo §2 " + 0(1). 

log 2 14.864 

Proof. Apply Proposition[2]to the bound \G(n,2)\ < [x 3n }U(x) G 0(14.864"). 



5 Further Improvements 

In this section we discuss techniques to further improve our lower bound in the previous section. 
These techniques give us the lower bounds k n > 0.52224 log 2 n + 0(1) and k n > 0.53028 log 2 n + 
0(1) respectively. Then we combine these techniques with a large set of relations, i? 16 , to obtain 
our final bound, 



k n > 0.561 log 2 n + 0(1). 



5.1 Additional Relations 

For our original lower bound, we used the three relations m 13 ~# m 31 , m 1223 ~ J? ^2312 and 
^1232 ~r ^2123- Recall that the congruence relation ~^ is strictly finer than ~, so there exist 
elements u, v that are equivalent under ~ but not under By adding more relations to R, we 
reduce the number of equivalence classes, and therefore improve the bound on k n . 

Suppose we let R' — R U {m 112223 ~ ^231122, ^122233 ~ "^223312} be a new set of relations. 
By the same process as before, we obtain a set of strings W C M 2 avoiding mi 3 , mi223, "^1232, 
^112223 and mi22233- The corresponding generating function is 



v ' 1 - 3a; + x 2 + 2x 4 + 2x 6 

The smallest real root of the denominator is approx. 0.41278, leading to [x n ]U (x) E 0(2.4229") 
and \G(n,2)\ e 0(14.218 n ). In the end, we get the lower bound k n > 0.52224 log 2 n + 0(1). 

We see that additional relations improve our lower bound. For our final result we will use R±e, 
the set of all nontrivial relations u ~ v such that \u\ ,\v\ < 16. We use a computer to generate the 
1591 relations in -RidE The algorithm for finding relations is roughly as follows. 



2 See http : / / www . student . cs . uwaterloo .ca/~13schaef/stacksort/ for the list of relations. 



1. We consider all strings of length £, and let s be the state with t elements in the input queue and 
£ elements in each queue. 

2. Then we compute w * s for each string w E M 2 of length £, and note that w * s ^ since we 
do not have enough moves to empty a stack and pop from it. 

3. Finally, the equivalence classes of w * s (under =) correspond to the equivalence classes of w 
(under ~) by Proposition |H 

It is clear that the algorithm is computationally expensive, and although there are ways to improve 
it, the running time is unavoiably exponential in i. We draw the line at R\% because of the cost 
of computing new relations and solving the resulting equations, as well as the rapidly diminishing 
contribution (in terms of the lower bound) of each new relation. 

There are some patterns among the relations we computed, although most of the relations do 
not fit any known pattern. Consider the following relations in K 

m 12 m 23 ~ m 23 nii 2 

^1122^123 ~ m 23 m U22 
m 12 m 2233 ~ rn 2233 m l2 . 

These relations are the first three examples of an infinite set of relations of the form [mium 2 ) (m 2 vm 3 ) ~ 
(m 2 vm 3 )(mium 2 ) for u,v E [E Similarly, there are relations of the form um 2 ~ m 2 u for 
u E M2 a complete string, although not every complete string gives a useful relation. For in- 
stance, u = mi 23 mi 23 is complete, but since we can already deduce um 2 ~ m 2 u from the relation 
^123^2 ~ ^2^123, the relation um 2 ~ m 2 u is redundant. 



5.2 Weighted Generating Functions 

Recall that to prove our main result, we found a generating function U(x) for the set U, weighted 
by string length. If we use a different weight function, which sends w to x^' 1 ^' 2 ^' 3 then we get 
a multivariate generating function U(x\, x 2 , x 3 ) for hi. Then the Guibas-Odlyzko method or the 
Goulden- Jackson cluster method can be used to compute U(x\, x 2l x 3 ): 

1 

U(x 1 ,x 2 ,x 3 ) 



1 - x 1 - x 2 - x 3 + X1X3 + 2xix%x 3 ' 

We have seen that \G(n, 2)| < \C x {n, 2) n U\ < |£ n (n, 2) n U\ and hence 

|G(n,2)| < |A(n,2) nW| < |£ n (n,2) nW| = \x\x\x^J{x x , x 2 , x 3 ). 

Notice that since all the coefficients of U(xi,x 2 , x 3 ) are positive integers, if we have a ring homo- 
morphism ip : Z[[xi, £2,^3]] — > ^[[ x ]] then 

{x T {x 2 x 3 }U(x 1 ,x 2 ,x 3 ) < [v?(xi) n ^(x2)Xx 3 ) n ]^(f/(xi,X2,x 3 )). 

This allows us to give m l5 m 2 and m 3 different weights, as in the following proposition. 



3 There are additional conditions on u and v. Specifically, u must transfer elements from the input queue to stack 2, and v must 
transfer elements from stack 1 to the output queue 



Proposition 5. For all n > 0, 

|G(n,2)| < [x n(ai+Q2+a3) ] . 

Proof. Take y> to be the ring homomorphism such that x.j >->■ x a " for z = 1, 2, 3. The result follows 
easily. 

For example, take c*i = a 3 = 1 and a 2 = 2. We get 

iG(»,2)i< [ ^]( r ^i +2:c ; 

G O(13.708 n ). 

Applying Proposition [2] gives 

fc n > 0.52953 log 2 n + 0(1). 

This is quite an improvement over our first result, considering that only the analysis changed. 
Unfortunately, it is not immediately obvious how to choose values for ax, «2> 013 G N to get the 
best possible bound. 

The asymptotic behaviour of a univariate generating function is connected to its radius of con- 
vergence, and therefore to the location of its poles. Let us assume that multivariate generating 
functions are similar, and look at the zeros of the denominator. In particular, let us try to find 
Xi,x 2 , x 3 G (0, 00) positive real numbers such that our polynomial denominator 1 — xi — x 2 — 
x 3 + X1X3 + 2x1X3X3 is zero and Xix 2 x 3 is minimized (or equivalently, £ is maximized). Using 
Lagrange multipliers, the solution satisfies the following equations: 

= 1 — Xi — x 2 — x 3 + X1X3 + 2x1X2X3 

= X1X3 + A(4xiX2X3 — 1) 

= x 2 x 3 + A(x 3 + 2x 2 x 3 — 1) 

= X1X2 + A(xi + 2xix 2 — 1). 

In this case, it is possible to solve these equations exactly by hand, but in general we employ a 
computer algebra system and obtain numerical results. The optimal solution is x x = x 3 = | and 

X2 = 1 — giving — - — = 8 + 4a/2 = 13.659 as the growth rate. Therefore, we expect that 
[x"x 2 X3]£7(xi, x 2 , x 3 ) is in 0((8 + 4a/2 + e) n ) for all e > 0, with a corresponding lower bound is 
k n > 0.53029 log 2 ra + 0(1). 

Now we will determine the optimal weights, and see an example to illustrate that close-to- 
optimal weights give a close-to-optimal lower bound. Our optimal weights should be such that 
X\ = x ai = x° 3 = x 3 and x 2 = x° 2 . It follows that the ratio of two weights is 



log (l 



V2 

a 2 x 2 \" 2 



1.77155. 



ax x x log(l/2) 

Hence, we take integer weights a x = a 3 = 4 and a 2 = 7 which give the ratio a 2 /ai = 1.75 ~ 
1.77155. With these weights we obtain 

[x^ ai+a2+a3)n }U(x ai ,x a2 ,x a3 ) = [x 15n }U(x\x 7 ,x 4 ) G 0(13.657") 

and a corresponding lower bound of k n > 0.53028 log 2 n + 0(1), which is virtually indistinguish- 
able from the bound we would expect with optimal weights. 



5.3 Main Lower Bound 

For our main result, we apply the weighted generating function technique to a large set of relations, 
Riq. For simplicity, we assume the weights for mi and m 3 are equal, so x\ = x 3 . We obtain a 
generating function U(xi, x 2 , x 3 ) = p{xi ^ X3) where 

p(x\, £2, %i) :=1 — 2xi — X2 + x\ + 2x\x\ + 2x\x\ + 2x\x\ + hx\x% + ^x\x\ + 14x^2+ 

82^ + 13x^ + Vlx\x% + 22^ + 40x^ + ±\x\x\ + 132a£z£ + 77xfx 6 2 + 
123x?a£ + 134x?a;| + 429z?a;| + 252x?x 2 7 + 248xf x\. 
We wish to maximize -^-^ subject to p(xi, x 2 , Xi) = 0, and obtain optimal weights from the 
solution. The optimal solution is at x\ = 0.47565, x 2 = 0.37405 with objective value j3 := = 
11.817. Then by choosing the right weights we get \G{n, 2)| < |£ n (n, 2) n U\ G 0((/3 + e) n ) for 
arbitrarily small e > 0. For sufficiently small e, this gives us the bound 

k n > 0.561361og 2 n + O(l). 

6 Conclusion 

In summary, we gave a proof of Knuth's lower bound (k n > \ log 2 n + 0(1)) using a counting 
argument that bounded G(n, k). We generalized the counting argument from single stacks to pairs 
of stacks and discovered that many sequences of moves may generate a permutation. We found 
an equivalence relation on strings of moves, and used string rewriting techniques to reduce the 
number of strings, leading to a new lower bound, k n > 0.513 log 2 n + 0(1). 

In Section |5] we discussed techniques for improving the bound using more relations or more 
sophisticated analysis. The combination of these techniques and a set (i?i6) of 1591 relations give 
us our main result, 

k n > 0.561 log 2 n + 0(1). 

This bound can almost certainly be improved by using more relations, and finding the true asymp- 
totic complexity of k n remains an open problem. 
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